Dichotomizing continuous predictors in multiple regression: a bad idea.

نویسندگان

  • Patrick Royston
  • Douglas G Altman
  • Willi Sauerbrei
چکیده

In medical research, continuous variables are often converted into categorical variables by grouping values into two or more categories. We consider in detail issues pertaining to creating just two groups, a common approach in clinical research. We argue that the simplicity achieved is gained at a cost; dichotomization may create rather than avoid problems, notably a considerable loss of power and residual confounding. In addition, the use of a data-derived 'optimal' cutpoint leads to serious bias. We illustrate the impact of dichotomization of continuous predictor variables using as a detailed case study a randomized trial in primary biliary cirrhosis. Dichotomization of continuous data is unnecessary for statistical analysis and in particular should not be applied to explanatory variables in regression models.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

EFFECT OF DICHOTOMIZING CONTINUOUS VARIABLES IN REGRESSION MODELS by

Jose Francisco Cumsille. Effect of Dichotomizing Continuous Variables in Regression Models (Under the Direction of Dr. Shrikant I. Bangdiwala). The dichotomization of continuous variables is a very common practice when people analyze data. Some of the consequences of such dichotomization are well known: loss of information, grouping people in the same group when they are different, loss of powe...

متن کامل

Dichotomizing rating scale scores in psychiatry: a bad idea?

In psychiatry, the use of rating scales as measures of outcome in clinical trials allows us to generate continuous outcome data, where each individual's outcome is measured in numbers. Continuous outcomes can be divided into two categories, such as improved and not improved, or may be kept continuous. This article briefly presents the main advantages and disadvantages of these two approaches, w...

متن کامل

A comparison of two methods for estimating odds ratios: Results from the National Health Survey

BACKGROUND The practice of dichotomizing a continuous outcome variable does not make use of within-category information. That means the loss of information. This study compared two approaches in the modelling of the association between sociodemographic and smoking with obesity in adult women in Iran. METHODS We conducted a comparative study between two methods via an illustrative example, usi...

متن کامل

Mindfulness and Its Predictors in Women With Polycystic Ovary Syndrome

Background: Polycystic Ovary Syndrome (PCOS) is the most common endocrine disorder in women of reproductive age which can cause many problems such as hyperandrogenic symptoms and fertility problems. Objective: The present study aimed to determine the relationship of mindfulness with hyperandrogenic symptoms and demographic and fertility factors in women with PCOS. Methods: This descriptive co...

متن کامل

Multiple linear regression: accounting for multiple simultaneous determinants of a continuous dependent variable.

In many cardiovascular experiments and observational studies, multiple variables are measured and then analyzed and interpreted to provide biomedical insights. When these data lend themselves to analyzing the association of a continuous dependent (or response) variable to 2 or more independent (or predictor) variables, multiple regression methods are appropriate. Multiple regression differs fro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Statistics in medicine

دوره 25 1  شماره 

صفحات  -

تاریخ انتشار 2006